Large Vocabulary Continuous Speech Recognition a Review

نویسنده

  • Steve Young
چکیده

Considerable progress has been made in speech recognition technology over the last few years and nowhere has this progress been more evident than in the area of Large Vocabulary Recognition LVR Current laboratory systems are capable of transcribing continuous speech from any speaker with average word error rates of between and If speaker adaptation is allowed then after or minutes of speech the error rate will drop well below for most speakers Hitherto LVR systems have been limited to dictation applications since they were speaker dependent and they required words to be spoken with a short pause between them The capability to recognise natural continuous speech input from any speaker however opens up many more applications and as a result LVR technology appears to be on the brink of widespread deployment across a range of Information Technology IT systems This article will discuss the principles and architecture of current LVR systems and identify the key issues a ecting their future deployment To illustrate the various points raised the Cam bridge University HTK system will be described This is a modern design giving state of the art performance and it is typical of the current generation of recognition systems

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Large Vocabulary Continuous Speech Recognition

Large vocabulary speaker-independent speech recognition systems being capable of recognizing continuous speech based on hidden Markov models are today’s standard. This review introduces the fundamentals of speech and the underlying speech recognition problems. The three classical approaches, i.e., the acoustic-phonetic, the statistical (pattern) recognition and the artificial intelligence appro...

متن کامل

Confidence measure (CM) estimation for large vocabulary speaker-independent continuous speech recognition system

In this paper we report a study for confidence measure estimation in a large vocabulary speaker-independent continuous speech recognition system. A hybrid confidence measure estimation algorithm was developed. The final confidence measure consists of a number of confidence parameters which are generated from the different processing levels of the recognition system. A Parameter Reliability Anal...

متن کامل

Two-pass Algorithm for Large Vocabulary Continuous Speech Recognition

This paper presents a two-pass algorithm for Extra Large (more than 1M words) Vocabulary COntinuous Speech recognition based on the Information Retrieval (ELVIRCOS). The principle of this approach is to decompose a recognition process into two passes where the first pass builds the word subset for the second pass recognition by using information retrieval procedure. Word graph composition for c...

متن کامل

Advances in Large Vocabulary Continuous Speech Recognition

The development of robust, accurate and efficient speech recognition systems is critical to the widespread adoption of a large number of commercial applications. These include automated customer service, broadcast news transcription and indexing, voice-activated automobile accessories, large-vocabulary voice-activated cellphone dialing, and automated directory assistance. This article provides ...

متن کامل

Extra large vocabulary continuous speech recognition algorithm based on information retrieval

This paper presents a new two-pass algorithm for Extra Large (more than 1M words) Vocabulary COntinuous Speech recognition based on the Information Retrieval (ELVIRCOS). The principle of this approach is to decompose a recognition process into two passes where the first pass builds the words subset for the second pass recognition by using information retrieval procedure. Word graph composition ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004